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NORMALIZED RANDOM MEASURES DRIVEN BY INCREASING 

ADDITIVE PROCESSES 
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Walker 3 

ITAM-Mexico, Universita degli Studi di Pavia and University of Kent 

This paper introduces and studies a new class of nonparamet- 
ric prior distributions. Random probability distribution functions are 
constructed via normalization of random measures driven by increas- 
ing additive processes. In particular, we present results for the dis- 
tribution of means under both prior and posterior conditions and, 
via the use of strategic latent variables, undertake a full Bayesian 
analysis. Our class of priors includes the well-known and widely used 
mixture of a Dirichlet process. 

1. Introduction. This paper considers the problem of constructing a 
stochastic process, defined on the real line, which has sample paths be- 
having almost surely (a.s.) as a probability distribution function (d.f.). The 
law governing the process acts as a prior in Bayesian nonparametric prob- 
lems. One popular idea is to take the random probability d.f. as a normal- 
ized increasing process F(t) = Z(t)/Z, where Z = lim^oo Z(t) < +oo (a.s.). 
For example, the Dirichlet process [Ferguson (1973)] arises when Z is a 
suitably reparameterized gamma process. We consider the case of normal- 
ized random d.f.s driven by an increasing additive process (LAP) L, that is, 
Z(t) = J k(t, x) dL{x) and provide regularity conditions on k and L to ensure 
F is a random probability d.f. (a.s.). 

This paper represents a natural development of the work of Regazzini, 
Lijoi and Prunster (2003). These authors introduce the class of normalized 
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random d.f.s with independent increments (RMI), a particular case of IAP 
driven random d.f.s, and consider the problem of determining the exact dis- 
tribution of means of a normalized RMI. The study of means of random prob- 
ability d.f.s has become, after the pioneering work of Cifarelli and Regazzini 
(1979, 1990), a very active area of research, touching on both analytical 
and simulation based approaches. We mention, among others, Muliere and 
Tardella (1998), Guglielmi and Tweedie (2001), Regazzini, Guglielmi and 
Di Nunno (2002) and Lijoi and Regazzini (2004) for results in the Dirichlet 
case and Epifani, Lijoi and Priinster (2003), Hjort (2003) and Regazzini, 
Lijoi and Priinster (2003) for results beyond the Dirichlet case. 

In this paper we also consider both analytical and simulation based ap- 
proaches to the study of means, providing a comprehensive treatment of the 
subject. By extending the methodology proposed in Regazzini, Lijoi and 
Priinster (2003) to our more general case, we determine the exact law of any 
mean of a normalized IAP driven random d.f. This approach exploits Gur- 
land's inversion formula and gives expressions for the posterior distributions 
in terms of the Liouville-Weyl fractional integral. 

An important class of normalized IAP driven random d.f.s is obtained 
if limt— >ao k(t, x) is a constant for all x. Then we obtain the class of mix- 
tures of normalized RMI, that is, F(t) = J k(t, x) dG{x). Moreover, if G is a 
Dirichlet process we have a mixture of a Dirichlet process, first introduced 
by Lo (1984). This family was the focus of much attention during the 1990s 
as a consequence of the introduction of simulation based inference, first con- 
sidered by Escobar (1988) in his Ph.D. dissertation, and later developed 
by Escobar and West (1995) and MacEachern and Miiller (1998), among 
others. The model is comprehensively reviewed in the book edited by Dey, 
Miiller and Sinha (1998). By exploiting the above mentioned general results, 
we are able to give exact prior and posterior distributions for any mean of a 
mixture of a Dirichlet prior and, furthermore, provide a new simulation al- 
gorithm. Finally, we illustrate our results both theoretically and numerically 
by applying them to what we call a Dirichlet driven random probability d.f. 

Before proceeding we introduce the fundamental concepts and tools for 
the paper. Let L := {L{y) :y > 0} be any IAP defined on (fi, J 7 , P). In gen- 
eral, an IAP can be expressed as 

(1) L(y)= E L{rj} + L c (y), 

j-Tj<y 

where M = {ti,T2, •■•} is the set of fixed points of discontinuity and L c 
is the part of the process without fixed points of discontinuity. Hence L 
is characterized by the density functions of the jumps {L{t\}, L{t2}, ■ . .}, 
indicated by {f T1 , f T2 , . . .}, and the family of Levy measures, {v : u(y, dv),y > 
0}, related to L c through the celebrated Levy-Khintchine representation. 
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For an exhaustive account of the theory of IAPs see, for example, Sato 
(1999). 

Consider any nondegenerate measure a on such that a($L) = a € 

(0, +oo) and denote by A the corresponding d.f. The time change y = A{x) 
yields an a.s. [P] finite IAP La = {La(x) :i£l} uniquely determined by 
the family of Levy measures {u a : u a (x, dv) = u(G~ 1 (x),dv),x £ R}, where 
G a {y) := vai{x:A{x) > y} for y € (0, a(R)). Its Laplace transform is thus 
given by 



r r°° 

E[e' XLA ^]=exp - {1 - e- Xv }v a {x,dv) 
L ■'0 



for any A > 0. 



In the following it will be convenient to characterize La in terms of its Pois- 
son intensity measure, indicated by u a , instead of its corresponding family 
of Levy measures. Recall that D a ((—oo,x] x C) = v a (x,C) for every i£l 
and Ce^((0,+oc)). 

Consider now the stochastic process given by a convolution of La with 



Z:=\z{t)= f k{t,x)dL A {x):t£ 



Suppose k and v a satisfy simultaneously the following conditions: 

(I) 1 1— > k(t, x) is nondecreasing and right continuous with lim^-oo k(t, x) = 
for every x € R; 

(II) Jkx(o+oo)[1 — ex P{ — Xvk(x)}]i> a (dx dv) < +oo for every A > 0, where 

k(x) :=\im t ^ +OQ k(t,x); 
(III) D a (R x (0,+oo)) = +oo. 

Then Z is a random d.f. a.s. [P] and F = {F(t) = Z(t)/Z : t € M} is a random 
probability d.f. a.s. [P], having set Z := limj^+oo Z(t). For details about the 
determination of conditions (I)— (III) , refer to the Appendix. 

In this context, according to Barndorff-Nielsen and Shephard (2001), La 
can be seen as a background driving IAP. Hence, Z and F will be called an 
IAP driven random d.f. and a normalized IAP driven random d.f., respec- 
tively. There are now a number of works based on Levy driven processes; 
we mention Wolpert, Ickstadt and Hansen (2003) and Brockwell (2001) who 
introduce a Levy driven CARMA model. By choosing k(t,x) = ^(-oo,t]( x ) a 
normalized IAP driven random d.f. reduces to a normalized RMI, whose 
trajectories are discrete a.s. [P]. This property of normalized RMI may 
be undesirable in many situations. It is easily seen that a normalized IAP 
driven random d.f. has absolutely continuous sample paths with respect to 
the Lebesgue measure on R a.s. [P] if and only if 1 1— > k(t,x) is absolutely 
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continuous for every x £ K. If this is the case, the corresponding normalized 
IAP driven random density function is given by 

/(*) = kk'{t,x)_dL A {x) (i g R) ^ [p]) 

where k'(t, x) := Jj/c(t, x). In the following we will always assume F to admit 
a density. 

In Section 2 we derive the exact distributions of means of normalized IAP 
driven random measures under prior and posterior conditions and derive 
distributional results for means of normalized gamma driven random d.f.s 
and, in particular, for the mixture of a Dirichlet process. In Section 3 a 
sampling strategy for drawing samples from the posterior distribution of F 
is presented, and we provide a numerical illustration. All proofs are deferred 
to the Appendix. 

2. Distribution of means of normalized IAP driven d.f.s. In this section 
we are concerned with the problem of determining the prior and posterior 
distribution of means of normalized IAP driven random d.f.s, extending the 
results of Regazzini, Lijoi and Priinster (2003) (RLP). 

2.1. Existence and distribution of means. First of all we need to estab- 
lish the existence of f R g(t) dF(t), or equivalently of J R g(t) dZ(t). Suppose 
Jr IffWI dZ(t) < +co a.s. [P]. By application of Fubini's theorem, 

/ g(t)dZ(t)= f g(t) f k'(t,x)dL A (x)dt 
JR Jr Jr 

(2) 

= / h{x)dLA{x) a.s. [P], 
Jr 

where h(x) = f R g(t)k' (t,x) dt. Hence, a linear functional of an IAP driven 
random d.f. can be expressed as another linear functional of an IAP, which 
actually reduces our problem to the one considered by RLP. In terms of exis- 
tence, the previous relation guarantees that J R \g(t)\ dF(t) = f R h(x) dL-A(x) 
a.s. [P], having set h(x) = J K \g(t)\k' (t,x) dt. Thus, by a slight modification 
of Proposition 1 in RLP, we have the required necessary and sufficient con- 
dition. 

Proposition 1. Let F be any normalized IAP driven random d.f. and 
let g be any measurable function g:M— >• R. Set h{x) := f^\g(t)\k' (t,x) dt. 
Thenf R \g(t)\dF(t) < +oo a.s. [P] if and only if f Rxi0+oo) [l-exp(-Xvh(x))} x 
v a {dxdv) < +oo holds for every A > 0. 
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We now proceed to determine the probability distribution of f^g{t) dF(t). 
Assuming the conditions of Proposition 1 hold, we observe that for any 

g{t)dF(t)<o^=P^J {h(x)-ak(x)}dL A (x)<oj, 

where h(x) := J R g(t)k'(t,x)dt and k(x) := lim^ +00 k(t, x). Hence, we are 
able to extend Proposition 2 in RLP, which is based on the inversion formula 
given in Gurland (1948), to our more general case, with obvious modifica- 
tions. 



Proposition 2. Let F be a normalized IAP driven random d.f., let F 
be the probability d.f. of f R g(t) dF(t) and set h(x) = f^g(t)k' (t,x) dt. For 
every a £ R, we have 

= lim f -exp{ / \cos(sv(h(x) — ak(x))) — l]u a (dxdv) 

2 7rrT+ooJo s Uix(o,+oo) 

x sin / s'm{sv(h(x) — ak(x))}i/ a (dx dv) I ds. 

\JRx(0,+oo) / 

2.2. Posterior distribution of means. Here we aim at providing expres- 
sions for the posterior distribution of means of normalized IAP driven ran- 
dom d.f.s. This is done by introducing an appropriate sequence of nested 
partitions and by discretizing F through the discretization of both k and 
the space of observations. This construction guarantees the discretized pos- 
terior distribution of the mean to determine uniquely the limiting one, by 
a.s. convergence in distribution. Hence, we give an explicit expression for 
the posterior density of the discretized mean, which can be used as an ap- 
proximation of the limiting one. In certain cases, once the Levy measure is 
specified, it is also possible to derive an explicit representation of the limiting 
distribution. 

Assume that (fi, F, P) also supports a sequence T = (T n ) n >\ of exchange- 
able random variables. The first step consists in discretizing F. To this 
end, let us introduce a sequence of partitions (V m ) m >i of R, where V m := 
{A mj i :i = 0, . . . , k m + 1}, which satisfy the following properties: 

(a) Vm+i is a refinement of V m - 

(b) ^(R) is generated by U m >i a ^Pm)i where a(V m ) denotes the o- 
algebra generated by V m . 

(c) e rn := 2maxi<j< fcm diam(^ mji ) J, (as m -> +oo). 
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(d) A m .O — ( — OO, — R m ) , A m ^ — \t m ,ii tm,i+l) f° r * — 1, • ■ • , k m — 1, Am,k m — 
[tm,k m ) tm,km+l] 1 an d A m km+ i = (R m , +00), With t m> i = -R m , t m ,k m +l = Rrn 

and R m > for any m > 1. 

Now we have to select points a mj j in A m ,i for i = I, . . . ,k m and put a m> o = 
— i2 m and a m ,k m +i — ^m- Whenever the rth element, T r , in the sample lies in 
A m ,i, it is as if we had observed a m ,i- The discretized random d.f. is defined 
as 

(3) F m (i):= 2 ^ [fc(t ^ +1 ' x) ~|(W^)]^a(x) foreveryteM 

with the conventions k(t m> o,x) = and k(t m ,k m +2,x) = lim^ +00 x) = 
k(x). 

Set T n = (Ti, . . . ,T n ) and denote by /i n its distribution. The prior proba- 
bility d.f. of a mean J g(t) dF m (t) is denoted by F m (-; g) and its posterior d.f. 
by ¥ m ,t n ('',g)- Having (3) at hand, it is easy to verify that the approximation 
result given in Proposition 4 of RLP holds true also in this quite different 
setup. Hence, for every a belonging to the set of continuity points of F^ , 

(4) lim F* (a;g)=¥ t n(a;g) a.s. [//*]. 



m— »+oo 



Having derived F m according to Proposition 2, one can see that Proposition 3 
in RLP extends also to our more general case. Thus, supposing (a, b) is an 
interval containing all the g(a m ,i)'s and assuming interchangeability of the 
derivative with the integral, one has that the posterior density function 
of / g(t) dF m {t) , given T n = t n with ri{ v > terms set equal to a m ,i p (p = 
l,...,q) such that J2 P n i p = n i 1S given by 

(-1)" 



(5) Qn 

nU^+^mi^ro, • • • ,r k+ i) 



(ro,-..,r k +i)=(g(a m ,o),---,g(am,k+i)) 



where I™ + h(a) = f° ^°^ n "^i h{u) du is the Liouville-Weyl fractional integral, 
for n > 1, and I° + represents the identity operator. 

2.3. Normalized gamma and mixtures of Dirichlet process. Many Bayesian 
nonparametric priors are constructed via transformations of gamma pro- 
cesses. Hence, it seems natural to focus attention on normalized gamma 
driven random d.f.s. Here, a complete treatment of the distributional prop- 
erties of means of normalized gamma driven random d.f.s is provided. 

Before proceeding, let us briefly recall that a reparameterized gamma 
process, Ta, is characterized by a Poisson intensity measure of the type 
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i> a (dxdv) := e~ v v~ 1 dvda(x), while the extended gamma process, T^, intro- 
duced by Dykstra and Laud (1981), is characterized by a Poisson intensity 
measure of the type v a (dxdv) := e~ /3 ^ v v~ 1 dvda(x), where (3 is a nonneg- 
ative piecewise continuous function. These two IAPs are connected to the 
normalized gamma driven random d.f. and, in particular, to the mixture of 
Dirichlet process (MDP) through the following relations. 

Proposition 3. Suppose F is a normalized IAP driven random d.f. 
Then: 

(i) // La = F can be represented as a normalized gamma driven 
random d.f.s, that is, 

J R k(t,x)dT A (x) _ J^xmx^dTAix) 



J R ~k(x)dT A (x) ' J R k(x)(p(x))-UT A (x) 



a.s. [P]. 



(ii) If La = Fa, F can be represented as a mixture of a normalized ex- 
tended gamma process, that is, 



f R k(t,x) dTA(x) _ f k(t,x) dT l A k {x) 



f R k{x)dT A {x) Jm k(x) fVfc 



a.s. [P], 



having set T^( k := lim x ^+oo Y l A k (x). Moreover, if k(x) = b~ 1 , then F is an 
MDP, f R bk(t, x) d^A{x) , where *3 'a denotes the Dirichlet random d.f. 

(hi) If La = ^a, any mean of F, provided it exists, may be represented 
as a mean of a normalized extended gamma process, that is, 

g(t)dF(t)= h(x)^-U a.s. [P], 

JR f / 

withh(x) = (A;(x)) _1 f R g(t)k' (t,x) dt. If, moreover, k(x) = b~ 1 , then f R g(t) dF(t) 
becomes a mean of a Dirichlet process, f R bh(x) d^A(x) , where h(x) = f R g(t)k'(t,x 

Thus, we have that a normalized extended gamma driven random d.f. 
is equivalent to a normalized gamma driven random d.f. and that MDPs 
are a special case. Nonetheless, in studying means of normalized gamma 
driven random d.f.s we confine ourselves to MDPs, because to date nothing 
is known about exact distributions of their means. This is done without loss 
of generality, since the following results are easily extended to any normalized 
gamma driven random d.f. 

With reference to existence of a mean of an MDP, f R g(t) dF(t), by Propo- 
sition 3, the condition reduces to the well-known 

(6) / log(l + \\h{x)\)a(dx) < +oo for every A > 0, 
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with h(x) = J R g(t)k'(t,x) dt as previously. See Feigin and Tweedie (1989) 
and Cifarelli and Regazzini (1990, 1996). Consequently the d.f. of a mean 
Jjsi.9(t)dF(t) is given by 



(7) 



F(cj) = / - exp - / log{l + s 2 {h{x) - af}a{dx) 

2 IT Jo s I it 

x sm ^y arctan[s(/i(x) — a)]a(dx)j ds. 



The fact that our mean is just another mean with respect to the Dirich- 
let process implies that its law is absolutely continuous with respect to the 
Lebesgue measure. See Regazzini, Guglielmi and Di Nunno (2002) for ex- 
pressions of the corresponding density function. 

We now move on in stating our main result, which provides intuitive 
insight into the mixing character of the posterior behavior of means of MDPs. 

Theorem 1. Suppose F is an MDP and its mean f™g(t) dF(t) exists, 
that is, g satisfies (6). Then its posterior distribution, given T n = t n , is 
absolutely continuous {with respect to the Lebesgue measure on M) and a 
posterior probability density function is given by 



(8) Pt „( a )= J p u n(a)G(dui,...,du n \t n ), 

where 

G(du\, . . . ,du n \t 



YYj=i k'(tj,Uj)a n (dui, . . . , du r , 



/m» Wj=i k'{tj,Uj)a n {dui, du n ) 

represents the distribution of the latent variables U n , given the observations 
T n = t n , with a n defined as the n-fold product measure rifc=i( a + 2~2i=i ^uj, 
and p u n denotes the posterior distribution of f R h(x) d&A{x), given U n = u n , 
with h(x) = f R g(t)k'(t,x)dt, and given by 

Pu n {o~) = —J Refexpj — J log[l + is(h(x) — a)]a*(dx)^^j ds, 

having set a* = a + Ya=i $Ui ■ 

A deficiency of the previous intuitive result is represented by the dimen- 
sion of the integration region in (8), which grows as the sample size grows. 
This can be overcome by an application of Lemma 2 in Lo (1984), which 
essentially allows one to account for coincidences within the latent observa- 
tions. To this end let us introduce some notation. Denote by V := {d :i = 
1, . . . , N(V)} a partition of {1, 2, . . . , n}, where N(V) indicates the number 
of cells and Cj the ith cell in the partition. Moreover, let q be the number 
of elements in C; . 
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Corollary 1. Suppose J R g(t) dF(t) is a mean of an MDP and g sat- 
isfies (6). Then its posterior density function, given T n = t n , is given by 



where p u ^i denotes the posterior density of L h(x) d^A(x), given C{ obser- 
vations equal to u. 

It is worth pointing out that the burden involved in posterior densities, 
when dealing with more than a few observations, becomes overwhelming 
for currently available computational tools. The necessity of a simulation 
algorithm is evident. 

3. Posterior simulation. In this section we provide a method to sample 
from the posterior distribution of F, and /, given a set of n observations 
T n . The algorithm depends on the strategic and novel introduction of latent 
variables. Let S and U be latent variables, and consider the joint distribution 

p(t, s, u\La) = exp(— uZ )k'(t, s) dLA(s), u>0,s£l, 

where, as previously, La is a reparameterized IAP and k'(t,s) = -j^k(t,s). 

Note that La is a pure jump process and so the support of s will be 
the location of the jumps of La, that is, p(s\t,u, La) oc LA{s}k' (t, s) and 
La{s} = La{s) — La(s-). Clearly p(t\LA) = f(t), as required. 

Having established the general sampling strategy, let us consider, in par- 
ticular, normalized gamma driven random d.f.s. For computational reasons, 
we allow La to have fixed points of discontinuity. Recall the representation 
of such an IAP given in (1) together with the related notation. We work 
with a normalized extended gamma driven random d.f. which we know to 
be equivalent to a normalized gamma driven random d.f. by Proposition 3. 
Some other authors have obtained posterior distributions when working with 
additive processes in different contexts [see, e.g., Dykstra and Laud (1981), 
Hjort (1990), Walker and Muliere (1997) and Nieto-Barajas and Walker 
(2004)]. Let us start with a single observation T\\ then we obtain the follow- 
ing result, where Q denotes a gamma distribution. 

Proposition 4. Let F(t) = Z(t)/Z(T) be a normalized IAP driven ran- 
dom measure, where T is the maximum time up to where the process is ob- 
served and T\ is a random sample from F. Denote by M the set of prior 
fixed points of discontinuity of La and by * an updated parameter / function. 



(i) Given T\ = t\, S\ = si € M and U\ =u\, the posterior parameters 
are M* = M, 



Ey(Ui=i [(cj - l)!/R/Qu e <(g)IIpeCi fc/ ( t P»") Q! W]) 
EpttliST [(« - 1)! JmiW, k'(t p ,u)a(du)}) 




ifTj = Si, 

ifTj + Si,Tj < T. 
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(ii) Given T\ = t\, S\ = s\ M and U\ =u\, the posterior parameters 
are M* = M U {si} ; with 

fsi(x) = G(x\l,P(s 1 ) + uik(T } s 1 )), 

f*(x)^e-^ r ^f T .(x) ifTj<T. 

Furthermore, given T\ = t\ and U\ = u\ , the posterior distribution for the 
continuous part L C A {-) is L c A (s) ~ F^{a(s), /?*(s)} ; where (3*(s) = (3(s) + 
uik(T,s). Thus, the posterior distribution of the normalized random mea- 
sure is F*(t) = Z*(t)/Z* with Z*(t) = J k(t,x) dL\(x). 

Proposition 4 also holds for T = oo. However, for simulation purposes we 
need to truncate at T. Given this result, posterior simulation becomes quite 
straightforward. For n observations we have 

n 

p(t n , s n , u n \L A ) = J] exp(-n i Z )k'(t i ,s i ) dL A ( Si ). 
i=l 

Given La, sampling from p(si\t n , u n , La) and p(ui\t n , s n , La) is trivial, and 
given (t n ,s n ,u n ), the conditional posterior of La remains an additive pro- 
cess. We will need to implement a Gibbs sampler in the following way. As- 
suming that M = 0, then initiate the algorithm by generating Uj ~ C/(l, 1), 

11 ~Z//(0,tj) for i = 1, . . . ,n, where U denotes the uniform distribution. For 
iterations h = 1, . . . , H do: 

1. Generate L A from p(L^|t n ,s n<h 1) ,u n(h X> ) with the following specifi- 
cations: 

(a) The Levy measure is given by 

i' ayS (dv) = dv / v~ l exp{— v(3*(x)}a(dx), 

J (— oo,s] 

where (3*(x) = (3(x) + k(T, x) £"=i uf 

(b) The set of fixed jumps = {s\^ h \ . . . , Sm'"'} is formed by all dif- 
ferent {s\ h ^} with r- , j = l,...,m, the number of s[ h ^ = s* for 
i = 1, ... ,n. 

(c) The distribution of the fixed jumps La{s*^} is 

/if = G (rf\f}{sf h) ) + k(T, ut 1 ^ ■ 

2. Generate sf 1 ' from p(si\t n , u n< ' h 1) ,L <y A ) for i = 1, . .. ,n given by 

p( SJ |^^n" (h - 1 ^4 h) )ocA : '(^ i , s 0^ ) (^)W 1 )( s ^ )• 
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3. Generate u\ from p{ui\t 



s n , L A ) for i = 1, . . . , n given by 



p(ui\t n ,s 




Remark. In order to simulate from the continuous part of the posterior 
Levy process L A (s), one option, which we employed, is to use the Fergu- 
son and Klass (1972) algorithm. An alternative is the inverse Levy method 
adopted by Wolpert and Ickstadt (1998). Both rely on approximations, mak- 
ing finite an infinite number of jumps. See Walker and Damien (2000) for 
the ideas. 

3.1. Numerical example. Let us consider the case in which La = Ta and 
k(t,x) = - [1 — exp{— a(t — x)}]Im t j(x) (a 6 M + ), where I denotes the indi- 
cator function. This kernel has been motivated and used by Nieto-Barajas 
and Walker (2004). Thus F is an MDP or, better, a Dirichlet driven random 
probability d.f. of the form 



In this case one easily verifies that the arithmetic mean J£°tdF(t) can be 
written as f£°(x + -) dS>A{x); hence, its distribution is that of / °° x d2>A{x) 
shifted by the factor 1/a. The posterior density function of the mean, hav- 
ing observed T n = t n , is given by a slight modification of (8). Since the 
expression is difficult to deal with, we resort to our simulation algorithm, 
having set a(dx) = 1^5] (x) dx and a = 2. We simulated n = 100 data points 
from a £7(1, 1). Recall that the jumps of an IAP, when using the Ferguson 
and Klass algorithm, are simulated in a decreasing order according to their 
size. We truncated the number of jumps by calculating the relative error of 
a new jump and keeping only the jumps whose relative errors are greater 
than 0.0001. We ran the Gibbs sampling for 10,000 iterations with a burn-in 
of 1,000, keeping the last 9,000 simulations to obtain posterior summaries. 
Figure 1 presents the prior and posterior estimates of the normalized in- 
creasing process F. The prior estimate is placed away from the true d.f. and 
the posterior estimate follows very closely the true d.f., as expected. 

In Figure 2 we can observe the prior and posterior distributions of the 
mean for g(t) = t. Due to the fact that the data were generated from a 
£7(1, 1), the true value of the mean is 1. The prior distribution of the mean is 
situated away from the true value of the mean and has a large variance. The 




and its corresponding random density is given by 



/(*) 
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Time 

Fig. 1. Prior and posterior estimates of the d.f. ( — — ) True d.f., (■ ) prior 

estimate, ( ) posterior estimate. 

prior expected value of the mean is 2.74. On the other hand, the posterior 
distribution of the mean has a small variance and is concentrated around 1. 
The posterior expected value of the mean is 1.05. 

APPENDIX 

Details for the determination of conditions (I) (III). We have to show 
that, under (I)-(III), the sample paths of F := {F(t) = Z(t)/Z:t€ R} are 
random probability d.f.s a.s. [P]. 

Let us start with the denominator. We have to guarantee that < Z < 
+oo a.s. [P]. Supposing J R lim t ->+oo k(t, x) dLj^{x) is finite, we have Z = 
J R lim t ^ +00 k(t, x) cILa{x), a linear functional of a reparameterized IAP. Hence, 
Proposition 1 in RLP applies, leading one to state that Z is finite a.s. [P] if 
and only (II) holds. 

Consider now the problem of the a.s. [P] positiveness of Z. Notice that, 
if (II) holds, we have 

exp< — / [1 — exp(— Xv k(x))]D a (dx dv) > 

I JRx(0,+oo) J 
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\ 




12 3 4 

Mean 

Fig. 2. Prior and posterior distributions of the mean. ( — — ) Exact prior distribution, 
(■ ) simulated prior distribution, ( ) simulated posterior distribution. 



P{Z = 0} + E[e 



(0, + oo) 



where I denotes the indicator function. By the monotone convergence theo- 
rem, P{Z = 0} = lim x ^ +00 exp[J_ Rx ^ +00 - ) [l-exp(-Xvk(x))]i> a (dxdv)}. This 
entails that P{Z = 0} = if and only if lmiA->+oo Jr x (o +oo)[1 — ex P( — Avfc(x))] x 
D a (dxdv) = oo. Finally, we again apply monotone convergence so that P{Z > 
0} = 1 if and only if / Mx{0 +oo) v a (dxdv) = +oo. 

Turn to the numerator. In order to guarantee 1 1— > Z(t) to be nondecreasing 
and right continuous it is enough to suppose 1 1— > k(t,x) to be so for every 
i£M. Furthermore, if lim^-oo k(t, x) = 0, we will have limj^-oo Z{t) = 0. 

Proof of Proposition 3. (i) Let us start from the denominator. For 
every s£l, 



E 



expfis / k(x) dr^(x) 



exp 
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exp 



/. k{x) 
1 -expUsu— r- 

Rx(0,+oo)V V 



exp(— ?; J 



cL4(x) 



exp is 



k(x) 



dT A 



m 0{x) 

Applying the same arguments to the numerator, (i) follows. 

(ii) The relation follows by application of the same arguments as in (i). 

(iii) Follows immediately by application of Fubini's theorem. □ 

Proof of Theorem 1. In order to derive the posterior distribution, 
given T n = t n , we start by discretizing the MDP according to the proce- 
dure outlined in Section 2.2. The discretized random mean, at any fixed 
level m of the tree of nested partitions, will be of the form f^g(t) dF m (t) = 
JiEj=o +1 g(am,j)[H t m,j+i,x) - k(t m j,x)]d^ A (x) and hence its d.f. can be 
written as 



(9) 



— / -Im< exp< — / log 
vr Jo s [ [ Jr 



/fc+i 

1 +iS I H r m,j(k(t m j + l, 
\j=0 



k(t m j,x)) - a 



x a(dx) > >ds, 



where r m j = g(a m ,j), for j = 0, . . . , k m + 1, and Imz stands for the imaginary 
part of z € C. Moreover, recall that 



la (n-1)! 

is the Liouville-Weyl fractional integral, for n > 1, and J° + represents the 
identity operator. By applying (5) to (9) together with some algebra, one 
obtains that its posterior density, given T n = t n , can be represented as 

-l) q+1 

-!■ FZ x Jmil> m {<T), Hn = 2q, 

pA°) = { f _ iVi 

^ I^Re^cr), ifn = 2g + l, 



7T 



with 
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hoo 

x / s n_1 / expl — / log 



/fc+i 

1 +iS [ ^Z r j( k ( t m,j+l,x) 
\j=0 

— k{t m j ,x)) — <T 

x a*(dx) 



X Y[[k(t mt i p+ i,U p ) - k(t mt i p ,U p )] 
p=l 

x a n {du\, . . . , du n ) ds, 

where a n is the re-fold product measure nfc=i( a + J2i=i ^uj and &* is given 
by a + Sr=i^Mi- I n this case the expression for fj, n (t n ) is known, since it 
follows immediately by repeated application of Lemma 1 in Lo (1984), 

^(t n )=(f[(a(R) + i-l)) 



(11) 

x 



i=l 

ii 



/ ~[[k(tm,i p +i,u P ) - k(t m .i ,u p )]a n (dui, . . . ,du n ). 

From (4) we know that (10) can be used as an approximate posterior density. 
Nevertheless, in this case we are able to obtain an explicit representation of 
the limiting posterior density. 

Division of both numerator and denominator by Ilp=i [^"v P +i —t m ,i p ] and 
application of the dominated convergence theorem yield the limiting poste- 
rior density which is given by (10) with 

= n?=i(«w+^-i) 

fun Up=i k'(t p , u p )a n (du 1 , du n ) 

(12) x p/" 1 / exp(- / \og[l + is{h{x) -a))a*{dx)\ 

Jo it" I Jm. J 

n 

x k'(t p , u p )a n (du\, . . . , du n ) ds. 
P =i 

Note that by Scheffe's theorem we have also convergence in total variation 
of ¥ m j n to Fjn . By application of Fubini's theorem it is possible to rewrite 
the posterior density function as 

Ilp=i k'(t p , u p )a n (dui, du n ) 



(13) pt«(cr) = / p U "(cr) 



Jr Up=i k'(t p , u p )a n (du 1 ,. . . , du n ) ' 
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where, if n = 2q, 

(-i) g+1 n? = iHiR)+»-i) 

7T 

x I™" 1 ^ + °° s 11 - 1 Im(exp{- J log[l + is(h(x) - a)} 

x a*(da;)|j ds, 

while, if n = 2q + 1, p u ™{a) is obtained by simply substituting Im with Re 
in (14). Indeed, p u n is a posterior density, given U n = u n , of a mean of a 
Dirichlet process, precisely of J^h(x) d^A(x)- This can be seen by applying 
the procedure for derivation of posterior distributions of normalized RMI 
in Section 4 of RLP. Given the conjugacy of the Dirichlet process, we can 
replace (14) with the simpler expression 

(15) Pu n(a) = ^J Re^expj-^ log[l + is(h(x) - a)]a* (dx)^ ds. 

Thus one has that a mean of an MDP, resulting from the combination of 
(13) and (15), is a mixture of a particular mean of a Dirichlet process, given 
the latent data U n . By (11) it is easy to identify the mixing distribution as 
the distribution of U n conditionally on the real observations T n . □ 



Pu" (0"J 



(14) 



Proof of Proposition 4. The idea of the proof is to express the 
likelihood function in a tractable way so we are able to apply standard 
Bayesian updating mechanisms. Let T\ =t\ be a single observation from 
F, and let Si = si and U± = u\ be auxiliary variables. Then the likelihood 
function is given by 

lik(L J 4|ti, s\, u\) = expj— u\ J k(T, x) dL^(x)|fc'(ti, si) dL^si). 

Using product-integral properties [see, e.g., Gill and Johansen (1990)], the 
likelihood function can be rewritten as 



lik(L^|ti,si,ui) 



exp{— u\k(T, x) dL>A_{x)} 



xG[0,oo) 



k'(tx,s 1 )dL A (si] 



Following Dykstra and Laud (1981), the prior process La{-) can be char- 
acterized by dL A (u) ~ Q(da(u), /3(^)) for the continuous part and L{tj} ~ 
f T . (x) for the prior fixed jumps. Based on the independence between in- 
crements in the prior process, the posterior conditional distribution for the 
continuous part and for the prior fixed jumps come straightforward. The 
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only remaining point, to establish completely the posterior conditional dis- 
tribution of La(-), is the distribution of the new fixed jump at s\. For this, 
let 

dL A (si) = L A [si,si + e), 

and then 

p(LA[«i,ai + e)|ti,si,ui) L A [si J ai+e) 0l[ * 1,sl+e) e-f ulfc ( T ' 81 ) + ' 3 (' 1 )> LA [' 1 '» 1+B ). 
Taking the limit as e — ► 0, we finally obtain that 

p{L A {8l}\h, St, Mi) OC e -Wfc(T, S l)+/3( S i)}L.4{ S l} ; 

as stated in the proposition. □ 
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